gh-128013: fix data race in PyUnicode_AsUTF8AndSize on free-threading by kumaraditya303 · Pull Request #128021 · python/cpython

kumaraditya303 · 2024-12-17T10:48:59Z

Issue: Data race in PyUnicode_AsUTF8AndSize under free-threading #128013

colesbury

I think there's still a bug where PyUnicode_UTF8() is checked outside the lock, but the condition may change once the lock is acquired (because some other thread filled in the utf8 field).

I think we should refactor out the check into something like unicode_ensure_utf8 that does the double-checked locking:

static int
unicode_ensure_utf8(PyObject *unicode)
{
    int err = 0;
    if (PyUnicode_UTF8(unicode) == NULL) {
        Py_BEGIN_CRITICAL_SECTION(unicode);
        if (PyUnicode_UTF8(unicode) == NULL) {
            err = unicode_fill_utf8(unicode);
        }
        Py_END_CRITICAL_SECTION();
    }
    return err;
}

unicode_fill_utf8 should assert that the critical section is held.

vstinner · 2024-12-18T09:13:39Z

I wrote PR gh-128061 "Convert unicodeobject.c macros to functions" to prepare the code for this change.

kumaraditya303 · 2024-12-18T16:14:51Z

I have updated the PR to use the new static inline functions and it now uses acquire/release semantics for utf8 member. I have tested the reproducer from issue and now there aren't any data races AFAICS.

colesbury

Overall looks good, I think there's just one issue in _PyUnicode_CheckConsistency.

vstinner

LGTM

bedevere-bot · 2024-12-19T13:27:29Z

⚠️⚠️⚠️ Buildbot failure ⚠️⚠️⚠️

Hi! The buildbot AMD64 CentOS9 NoGIL Refleaks 3.x has failed when building commit 3c168f7.

What do you need to do:

Don't panic.
Check the buildbot page in the devguide if you don't know what the buildbots are or how they work.
Go to the page of the buildbot that failed (https://buildbot.python.org/#/builders/1610/builds/545) and take a look at the build logs.
Check if the failure is related to this commit (3c168f7) or if it is a false positive.
If the failure is related to this commit, please, reflect that on the issue and make a new Pull Request with a fix.

You can take a look at the buildbot page here:

https://buildbot.python.org/#/builders/1610/builds/545

Failed tests:

test_free_threading

Summary of the results of the build (if available):

==

Click to see traceback logs

remote: Enumerating objects: 15, done.        
remote: Counting objects:   6% (1/15)        
remote: Counting objects:  13% (2/15)        
remote: Counting objects:  20% (3/15)        
remote: Counting objects:  26% (4/15)        
remote: Counting objects:  33% (5/15)        
remote: Counting objects:  40% (6/15)        
remote: Counting objects:  46% (7/15)        
remote: Counting objects:  53% (8/15)        
remote: Counting objects:  60% (9/15)        
remote: Counting objects:  66% (10/15)        
remote: Counting objects:  73% (11/15)        
remote: Counting objects:  80% (12/15)        
remote: Counting objects:  86% (13/15)        
remote: Counting objects:  93% (14/15)        
remote: Counting objects: 100% (15/15)        
remote: Counting objects: 100% (15/15), done.        
remote: Compressing objects:  14% (1/7)        
remote: Compressing objects:  28% (2/7)        
remote: Compressing objects:  42% (3/7)        
remote: Compressing objects:  57% (4/7)        
remote: Compressing objects:  71% (5/7)        
remote: Compressing objects:  85% (6/7)        
remote: Compressing objects: 100% (7/7)        
remote: Compressing objects: 100% (7/7), done.        
remote: Total 8 (delta 7), reused 1 (delta 1), pack-reused 0 (from 0)        
From https://github.com/python/cpython
 * branch                    main       -> FETCH_HEAD
Note: switching to '3c168f7f79d1da2323d35dcf88c2d3c8730e5df6'.

You are in 'detached HEAD' state. You can look around, make experimental
changes and commit them, and you can discard any commits you make in this
state without impacting any branches by switching back to a branch.

If you want to create a new branch to retain commits you create, you may
do so (now or later) by using -c with the switch command. Example:

  git switch -c <new-branch-name>

Or undo this operation with:

  git switch -

Turn off this advice by setting config variable advice.detachedHead to false

HEAD is now at 3c168f7f79d gh-128013: fix data race in `PyUnicode_AsUTF8AndSize` on free-threading (#128021)
Switched to and reset branch 'main'

configure: WARNING: no system libmpdecimal found; falling back to bundled libmpdecimal (deprecated and scheduled for removal in Python 3.15)

make: *** [Makefile:2321: buildbottest] Error 2

vstinner · 2024-12-19T14:14:45Z

The buildbot AMD64 CentOS9 NoGIL Refleaks 3.x has failed when building commit 3c168f7.

The failure looks unrelated:

0:12:32 load avg: 8.03 [466/481/1] test_free_threading worker non-zero exit code (Exit code -6 (SIGABRT)) -- running (3): (...)

(...)

Races assigning to __dict__ should be thread safe ...

python: Objects/obmalloc.c:1219: process_queue: Assertion `buf->rd_idx == buf->wr_idx' failed.
Fatal Python error: Aborted

Thread 0x00007fbb557fa640 (most recent call first):
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/test_free_threading/test_dict.py", line 162 in writer_func
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 996 in run
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 1054 in _bootstrap_inner
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 1016 in _bootstrap

Current thread 0x00007fbb55ffb640 (most recent call first):
  Garbage-collecting
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/test_free_threading/test_dict.py", line 163 in writer_func
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 996 in run
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 1054 in _bootstrap_inner
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 1016 in _bootstrap

Thread 0x00007fbb567fc640 (most recent call first):
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/test_free_threading/test_dict.py", line 162 in writer_func
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 996 in run
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 1054 in _bootstrap_inner
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 1016 in _bootstrap

Thread 0x00007fbb56ffd640 (most recent call first):
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/test_free_threading/test_dict.py", line 158 in writer_func
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 996 in run
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 1054 in _bootstrap_inner
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 1016 in _bootstrap

Thread 0x00007fbb577fe640 (most recent call first):
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/test_free_threading/test_dict.py", line 158 in writer_func
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 996 in run
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 1054 in _bootstrap_inner
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 1016 in _bootstrap

Thread 0x00007fbb57fff640 (most recent call first):
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/test_free_threading/test_dict.py", line 162 in writer_func
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 996 in run
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 1054 in _bootstrap_inner
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 1016 in _bootstrap

Thread 0x00007fbb16ffd640 (most recent call first):
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/test_free_threading/test_dict.py", line 158 in writer_func
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 996 in run
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 1054 in _bootstrap_inner
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 1016 in _bootstrap

Thread 0x00007fbb15ffb640 (most recent call first):
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/test_free_threading/test_dict.py", line 158 in writer_func
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 996 in run
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 1054 in _bootstrap_inner
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 1016 in _bootstrap

Thread 0x00007fbb5d092740 (most recent call first):
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/threading.py", line 1105 in join
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/test_free_threading/test_dict.py", line 188 in test_racing_set_object_dict
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/unittest/case.py", line 606 in _callTestMethod
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/unittest/case.py", line 660 in run
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/unittest/case.py", line 716 in __call__
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/unittest/suite.py", line 122 in run
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/unittest/suite.py", line 84 in __call__
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/unittest/suite.py", line 122 in run
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/unittest/suite.py", line 84 in __call__
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/unittest/suite.py", line 122 in run
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/unittest/suite.py", line 84 in __call__
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/unittest/suite.py", line 122 in run
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/unittest/suite.py", line 84 in __call__
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/unittest/runner.py", line 259 in run
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/libregrtest/single.py", line 58 in _run_suite
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/libregrtest/single.py", line 38 in run_unittest
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/libregrtest/single.py", line 136 in test_func
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/libregrtest/refleak.py", line 132 in runtest_refleak
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/libregrtest/single.py", line 88 in regrtest_runner
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/libregrtest/single.py", line 139 in _load_run_test
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/libregrtest/single.py", line 184 in _runtest_env_changed_exc
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/libregrtest/single.py", line 284 in _runtest
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/libregrtest/single.py", line 313 in run_single_test
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/libregrtest/worker.py", line 83 in worker_process
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/libregrtest/worker.py", line 118 in main
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/test/libregrtest/worker.py", line 122 in <module>
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/runpy.py", line 88 in _run_code
  File "/home/buildbot/buildarea/3.x.itamaro-centos-aws.refleak.nogil/build/Lib/runpy.py", line 198 in _run_module_as_main

Extension modules: _testcapi (total: 1)

…hreading (python#128021)

bedevere-app · 2025-01-02T14:42:45Z

GH-128417 is a backport of this pull request to the 3.13 branch.

…reading (#128021) (#128417)

…hreading (python#128021)

kumaraditya303 changed the title ~~gh128013: fix data race in PyUnicode_AsUTF8AndSize on free-threading~~ gh-128013: fix data race in PyUnicode_AsUTF8AndSize on free-threading Dec 17, 2024

bedevere-app Bot mentioned this pull request Dec 17, 2024

Data race in PyUnicode_AsUTF8AndSize under free-threading #128013

Closed

kumaraditya303 force-pushed the utf8 branch 2 times, most recently from 87bdaea to 0f692ce Compare December 17, 2024 12:35

vstinner reviewed Dec 17, 2024

View reviewed changes

Comment thread Objects/unicodeobject.c Outdated

Comment thread Lib/test/test_capi/test_unicode.py Outdated

Comment thread Lib/test/test_capi/test_unicode.py Outdated

colesbury self-requested a review December 17, 2024 15:30

colesbury reviewed Dec 17, 2024

View reviewed changes

Comment thread Objects/unicodeobject.c Outdated

Comment thread Objects/unicodeobject.c Outdated

kumaraditya303 marked this pull request as ready for review December 18, 2024 15:39

bedevere-app Bot added the awaiting core review label Dec 18, 2024

fix race

587f9d6

kumaraditya303 force-pushed the utf8 branch from 0f692ce to 587f9d6 Compare December 18, 2024 16:12

add newline

d829ed0

kumaraditya303 added the skip news label Dec 18, 2024

kumaraditya303 requested a review from colesbury December 18, 2024 16:28

colesbury reviewed Dec 18, 2024

View reviewed changes

Comment thread Objects/unicodeobject.c

Comment thread Objects/unicodeobject.c

fix _PyUnicode_CheckConsistency

5fd952e

vstinner approved these changes Dec 19, 2024

View reviewed changes

bedevere-app Bot added awaiting merge and removed awaiting core review labels Dec 19, 2024

Merge branch 'main' into utf8

756085e

kumaraditya303 merged commit 3c168f7 into python:main Dec 19, 2024

bedevere-app Bot removed the awaiting merge label Dec 19, 2024

kumaraditya303 deleted the utf8 branch December 19, 2024 11:38

srinivasreddy pushed a commit to srinivasreddy/cpython that referenced this pull request Dec 23, 2024

pythongh-128013: fix data race in PyUnicode_AsUTF8AndSize on free-t…

6f3f0d4

…hreading (python#128021)

kumaraditya303 added a commit to kumaraditya303/cpython that referenced this pull request Jan 2, 2025

pythongh-128013: fix data race in PyUnicode_AsUTF8AndSize on free-t…

5d413ec

…hreading (python#128021)

kumaraditya303 added a commit that referenced this pull request Jan 2, 2025

[3.13] gh-128013: fix data race in PyUnicode_AsUTF8AndSize on free-th…

fa6c48e

…reading (#128021) (#128417)

srinivasreddy pushed a commit to srinivasreddy/cpython that referenced this pull request Jan 8, 2025

pythongh-128013: fix data race in PyUnicode_AsUTF8AndSize on free-t…

cc619b3

…hreading (python#128021)

kumaraditya303 added the topic-free-threading label Jun 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-128013: fix data race in PyUnicode_AsUTF8AndSize on free-threading#128021

gh-128013: fix data race in PyUnicode_AsUTF8AndSize on free-threading#128021
kumaraditya303 merged 4 commits into
python:mainfrom
kumaraditya303:utf8

kumaraditya303 commented Dec 17, 2024 •

edited by bedevere-app Bot

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

colesbury left a comment

Uh oh!

Uh oh!

Uh oh!

vstinner commented Dec 18, 2024

Uh oh!

kumaraditya303 commented Dec 18, 2024

Uh oh!

colesbury left a comment

Uh oh!

Uh oh!

Uh oh!

vstinner left a comment

Uh oh!

bedevere-bot commented Dec 19, 2024

Uh oh!

vstinner commented Dec 19, 2024

Uh oh!

bedevere-app Bot commented Jan 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Uh oh!

Conversation

kumaraditya303 commented Dec 17, 2024 • edited by bedevere-app Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

colesbury left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

vstinner commented Dec 18, 2024

Uh oh!

kumaraditya303 commented Dec 18, 2024

Uh oh!

colesbury left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

vstinner left a comment

Choose a reason for hiding this comment

Uh oh!

bedevere-bot commented Dec 19, 2024

⚠️⚠️⚠️ Buildbot failure ⚠️⚠️⚠️

Uh oh!

vstinner commented Dec 19, 2024

Uh oh!

bedevere-app Bot commented Jan 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kumaraditya303 commented Dec 17, 2024 •

edited by bedevere-app Bot

Loading